Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
Hetta | 3029 | 88 | 1 | 88.0000 |
Í | 5356 | 253 | 4 | 63.2500 |
Men | 1616 | 63 | 1 | 63.0000 |
Og | 1111 | 44 | 1 | 44.0000 |
Ein | 1347 | 77 | 2 | 38.5000 |
Sambært | 771 | 33 | 1 | 33.0000 |
Tey | 592 | 32 | 1 | 32.0000 |
Tú | 683 | 32 | 1 | 32.0000 |
men | 4404 | 124 | 4 | 31.0000 |
ið | 9439 | 323 | 11 | 29.3636 |
So | 438 | 29 | 1 | 29.0000 |
Tað | 4856 | 104 | 4 | 26.0000 |
Við | 1140 | 48 | 2 | 24.0000 |
Sum | 824 | 47 | 2 | 23.5000 |
Ásetingin | 329 | 21 | 1 | 21.0000 |
Henda | 616 | 41 | 2 | 20.5000 |
Verður | 347 | 20 | 1 | 20.0000 |
Uppskotið | 398 | 19 | 1 | 19.0000 |
Hon | 290 | 19 | 1 | 19.0000 |
Á | 1009 | 56 | 3 | 18.6667 |
Word | Frequency | Number of right neighbors | Number of left neighbors | Ratio |
---|---|---|---|---|
kr | 2136 | 1 | 68 | 0.0147 |
nr | 2069 | 1 | 38 | 0.0263 |
m.a | 856 | 1 | 30 | 0.0333 |
mió | 1355 | 5 | 100 | 0.0500 |
t.d | 1109 | 1 | 20 | 0.0500 |
partur | 990 | 2 | 40 | 0.0500 |
uml | 397 | 1 | 18 | 0.0556 |
stk | 3782 | 1 | 17 | 0.0588 |
v.m | 553 | 2 | 32 | 0.0625 |
part | 388 | 1 | 14 | 0.0714 |
tús | 211 | 1 | 13 | 0.0769 |
mió.kr | 209 | 1 | 13 | 0.0769 |
kl | 511 | 3 | 36 | 0.0833 |
fyrst | 499 | 2 | 23 | 0.0870 |
grundað | 178 | 1 | 11 | 0.0909 |
langa | 101 | 1 | 11 | 0.0909 |
tkr | 149 | 1 | 11 | 0.0909 |
innlit | 190 | 1 | 11 | 0.0909 |
møguleika | 491 | 3 | 31 | 0.0968 |
mongu | 78 | 1 | 10 | 0.1000 |
In this subsection, we compute the ratio of the number of right neighbors and the number of left neighbors. Again, we look for words with extreme ratios:
Data for first table:
select word,w.freq,aa.cnt, bb.cnt,aa.cnt/bb.cnt as r from words w, (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where w_id=aa.w1_id and aa.w1_id=bb.w2_id order by r desc limit 20;
Diagram data:
select aa.cnt, bb.cnt from (select w1_id,count(c.w2_id) as cnt from co_n c where w1_id>100 group by w1_id) aa, (select w2_id,count(c.w1_id) as cnt from co_n c where w2_id>100 group by w2_id) bb where aa.w1_id=bb.w2_id;
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.2 Number of NN co-occurrences vs. Frequency II